Combining Statistical and Rule-Based Approaches to Morphological Tagging of Czech Texts

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Statistical and Rule-Based Approaches to Morphological Tagging of Czech Texts

is article is an extract of the PhD thesis (Spoustová, 2007) and it extends the article (Spoustová et al., 2007). Several hybrid disambiguationmethods are describedwhich combine the strength of hand-written disambiguation rules and statistical taggers. ree different statistical taggers (HMM,Maximum-Entropy and Averaged Perceptron) and a large set of hand-written rules are used in a tagging ex...

متن کامل

Combining Rule-Based and Statistical Syntactic Analyzers

This paper presents the results of a set of preliminary experiments combining two knowledge-based partial dependency analyzers with two statistical parsers, applied to the Basque Dependency Treebank. The general idea will be to apply a stacked scheme where the output of the rule-based partial parsers will be given as input to MaltParser and MST, two state of the art statistical parsers. The res...

متن کامل

Rule Induction: Combining Rough Set and Statistical Approaches

In this paper we propose the hybridisation of the rough set concepts and statistical learning theory. We introduce new estimators for rule accuracy and coverage, which base on the assumptions of the statistical learning theory. Then we construct classifier which uses these estimators for rule induction. These estimators allow us to select rules describing statistically significant dependencies ...

متن کامل

Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions

This work presents a part of a more global study on the problem of parsing of Czech and on the knowledge extraction capabilities of the Rule-based method. It is shown that the successfulness of the Rule-based method for English and its unsuccessfulness for Czech, is not only due to the small cardinality of the English tagset (as it is usually claimed) but mainly depends on its structure (”regul...

متن کامل

Statistical modality tagging from rule-based annotations and crowdsourcing

We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automa...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Prague Bulletin of Mathematical Linguistics

سال: 2008

ISSN: 1804-0462,0032-6585

DOI: 10.2478/v10108-009-0002-x